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replica symmetry breaking (IRSB) ansatz. We derive a general criterion for the validity of this 
ansatz and, applying it to the ground state, we provide evidence that the IRSB solution gives 
exact threshold values Cq for the q-COL/UNCOL phase transition. We also study the asymptotic 
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mathematical bounds, as well as the nature of excited states, and give a global phase diagram of 
the problem. 
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I. INTRODUCTION 

The graph coloring problem (COL) has been studied both in combinatorics and in statistical physics ■ Given 
a graph, or a lattice, and given a number q of available colors, the problem consists in assigning a color to each vertex 
such that no edge has two equally colored end vertices. For a given graph, one quantity of interest is thereby the 
minimal number of colors needed, i.e. the so-called chromatic number. 

In this paper we are going to consider COL as applied to random graphs of fluctuating as well as of fixed connectivity. 
In fact, determining their chromatic number is one of the most fundamental open problems in random-graph theory 
0. It has attracted considerable interest also within the theoretical computer-science literature: COL is one of the 
basic NP-hard problems which form the very core of complexity theory Defined on random graphs, the problem 
shows interesting phase transitions at the so-called q-COL/UNCOL thresholds c,: Graphs of average connectivity 
c < Cq owe proper g-colorings with high probability (approaching one for graph size — > 00), whereas graphs of 
higher connectivity require more than q colors. This transition is connected to a pronounced peak in the numerical 
resolution time, i.e. in the time needed to either construct a g-coloring or to prove its non-existence. The hardest to 
solve problems are typically situated close to the phase boundary. 

One of the first important mathematical results for g-COL on Erdos-Renyi random graphs of average connectivity 
c was obtained by Luczak more than one decade ago 0. He showed in particular that, for a random graph of given 
finite average connectivity c, the chromatic number takes one out of only two possible consecutive values with high 
probability. Even if he was not yet able to determine these values, he showed that q colors are not sufficient for almost 
all graphs with c > 2glng — Ing — 1 + o(l). Rephrased in terms of the g-COL/UNCOL threshold Cq, he thus proved 
the upper bound < 2(7ln(7 — Ing — 1 + o(l). Very recently Achlioptas and Naor 6] put a rigorous lower bound on 
the threshold using the second moment method. They showed that Cq > 2qhiq — 2hiq + o(l), but their method in 
fact leads to an even better conjectured lower bound Cq > 2(7ln(7 — Ing — 2 -I- o(l) |3| which differs just by one from 
Luczak's upper bound. So, up to these small intervals, the exact and unique value of the chromatic number is known 
by now. Inside these intervals, even more powerful methods are needed to determine the chromatic number and thus 
the COL/UNCOL threshold. 

If one considers, on the other hand, the performance of linear-time algorithms, g-colorings can be easily constructed 
up to connectivity c ~ glng, i.e. only in roughly the first half of the colorable phase. It is simple to design algorithms 
working up to c = (1 — e)glnfl, for any e > 0, whereas no linear-time algorithm is known which works also for 
connectivity c = (1 + e)q\nq [3, 0, 0|. The very existence of linear algorithms working also beyond this point is 
considered as another major open question within the field. 

Recently, the problem has been reconsidered using tools borrowed from statistical mechanics of disordered systems 
[1^ IT^ . In this way both questions, i.e. the location of the g-COL/UNCOL threshold and the reason for the failure 
of linear-time algorithms well before this threshold, have further approached an answer, though not on completely 
rigorous grounds. Within the IRSB approach, the g-COL/UNCOL transition Cq can be determined for an arbitrary 
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number of colors q. Moreover, the IRSB approach predicts a connectivity region Cd < c < Cq inside the colorable 
phase, where solutions are non-trivially organized in clusters, an exponential number of metastable states and large 
energetic barriers exist. This clustering phenomenon - intuitively - causes local algorithms to get stuck, see for 
recent results. 

All these results were derived with the cavity method in IRSB approximation 15J which makes strong hypotheses 
on the phase space structure of the problem. The computation is further on heuristic in the sense that assumptions 
are only checked self-consistently, but its predictions are confirmed by independent numerical tests. Very recently 
a number of papers have put under scrutiny the clustering hypothesis in glass models on the Bethe lattice , in 
some combinatorial optimization problems like random if-XORSAT and iiT-SAT [itL fisl IT9| as well as in polymer 
problems |2(]| . They analyze the possibility of more complex patterns of clusterization due to local instabilities. 

In this work we investigate these instabilities for the coloring problem. We thereby show that a small part of the 
IRSB solution of |T3. [l3i] close to the onset of the clustered phase turns out to be unstable. Interestingly enough, 
however, the g-COL/UNCOL transition is in the stable region at any q, and thus the IRSB threshold results are 
expected to be exact. We will also analyze the stability of the coloring problem on fixed connectivity random graphs, 
which is somehow easier to deal with analytically. 

The outline of the paper is as follows: Sec. ^1 Properly defines the problem under investigation, and reviews the 
IRSB approach. In Sec. lIIII we set up the general formalism for the stability analysis. The most relevant consequences 
of this approach for a small number of available colors are then presented in Sec. II VI for the TYPE-I instability, and 
in Sec. for the one of TYPE-II. Sec. IVIl is devoted to the high q analysis of the model while, in Sec. lVIII we finally 
consider the problem at finite energies. Conclusions and perspectives are drawn in Sec. IVIIII 

II. THE MODEL AND ITS IRSB SOLUTION 
A. The graph coloring problem 

Let us start with a proper definition of the problem. We consider a graph G — {V,£) defined by its vertices 
V = {1,...,A^} and undirected edges € £ which connect pairs of vertices i,j G V. A graph q-coloring is a 

mapping a : V— s- {!,..., g} which assigns colors 1, g to all vertices, such that no edges are monochromatic. For all 
edges (i, j) e £ we have therefore ai ^ aj. 

Within the statistical-mechanics approach, a Hamiltonian is assigned to this problem such that all g-colorings are 
found as ground states. For any color assignment, i.e. {cTi} G {1, 2, . . . ,q} for all i G V, we therefore define 

(ij)e£ 

with S{-,-) denoting the Kronecker symbol. This Hamiltonian counts the number of monochromatically colored 
edges, a proper coloring of the graph thus has zero energy. In a physicist's language, the Hamiltonian describes an 
anti-ferromagnetic q-state Potts model on the graph G. 

The aim within the statistical mechanics approach is to study the ground state properties of this model: If the 
ground state energy equals zero, the graph is colorable. The ground-state entropy determines the number of colorings, 
and the order parameter, see below, characterizes the statistical properties of the ensemble of all solutions. If, on 
the other hand, the ground-state energy becomes positive, we know that there are no proper colorings. The graph is 
uncolorable with q colors. 

B. Erdos-Renyi random graphs and regular random graphs 

We consider the graph coloring problem on two different random-graph ensembles. 

The first one is the ensemble Q{N,c/{N — 1)) first introduced by Erdos and Renyi in the late 1950s 0. A graph 
from this ensemble consists of N vertices j = 1, ...,N. Between each pair i,j of vertices, with i < j, an undirected 
edge is drawn randomly and independently with probability c/{N — 1). The vertices remains unconnected by a direct 
edge with probability 1 — c/ (A'' — 1). 

Here we are mainly interested in the thermodynamic limit N oo, i.e. we describe large graphs of finite c. The 
average vertex degree, which equals the expected number of edges incident to an arbitrary vertex, is easily calculated 
as {N — 1) • c/{N — 1) = c, and it remains finite in the large- A^ limit. There are, however, degree fluctuations for every 
finite c. In fact, in the thermodynamic limit, the probability that a randomly selected vertex has degree d, is given 
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by the Poissonian distribution 

P. = e-=^ (2) 

of mean c. Another crucial point for our analysis is that, for finite c, the number of triangles or other short loops in 
the graph remains finite in the large- iV limit. This means that the graph is almost everywhere locally tree-like, i.e. 
on finite length scales it looks like a tree. For c > 1, there exists an extensive number of loops. These have, however, 
length ©(In TV), and become infinitely long for — > oo. 

The second ensemble is denoted by Qc{N), and contains all c-regular graphs of N vertices, where c has to be a 
positive integer in this case. A graph is called c-regular if and only if all vertices have the same degree c, i.e. here we 
have 

Pd = 5{d,c) . (3) 

A random regular graph is one randomly selected element of this ensemble. This guarantees again that the graph 
becomes locally tree-like. Note that due to the constant vertex degree, these graphs look locally homogeneous, on 
finite length scales they do not show any disorder. The random character of regular graphs enters only via the long 
loops which are again of length ©(In A). In the statistical physics literature, in particular in the theory of disordered 
and glassy systems, random graphs are considered as one valid definition of a Bethe lattice. 

In a slightly more general context, both randor n graph ensembles defined before can be embedded into the ensemble 
of random graphs with given degree distribution |2lll22|. For these graphs, only pd is defined, and each graph having 
the desired degree distribution is considered to be equiprobable. In this sense, all results formulated below can be 
directly generalized to arbitrary degree distributions. Sometimes the formulation will even be given in this context, 
but the concrete analysis will be restricted to Poissonian and regular graphs. 

For this generalized ensemble, we still introduce the probability that an arbitrary end-vertex of a randomly 
selected edge has excess degree d, i.e. it is contained in d supplementary edges, and its total vertex degree is c? -I- 1. 
Given the degree distribution, this probability results as 

,^^id+l)pd+, (4) 

Plugging in our special cases, we see that this distribution remains Poissonian for Erdos-Renyi graphs, whereas the 
excess degrees equal constantly c — 1 in the case of regular graphs. 



C. Survey propagation equations for g-coloring 

The cavity equations for finite-connectivity systems in the one-step replica-symmetry broken approximation have 
been originally derived in ;15t 1231] . Their single sample version is usually called survey propagation (SP) and has 
been introduced in [23. l25j | in the case of random 3-SAT. Here, in order to fix the notation, we will briefiy recall the 
SP equations on the IRSB level for q-COL closely following Note, however, that a complete explanation 

of all technical details is not the scope of this paper, for a detailed presentation please see therefore the original 
publications 

The zero-temperature properties of the system (or local minima of Hq) can be completely characterized by the 
edge-dependent probability distributions £ £)'■ 

Qr^j{u) = ri'l^jdiu) + vUjSiu + er) (5) 

where the vectors {ei, • • • ,6^} form the usual g-dimensional canonical Euclidean base set, with components = 
5(r, cr). The rji^jS are positively defined probabilities, normalization implies X]r=o'?i-^j ~ distribution 
Qi^j(u) is called a survey, and it describes the probability that, in a suitably chosen metastable state, or local 
minimum of Hq, a warning u is send from vertex i via the edge («, j) to vertex j. Possible warnings are the vectors 
— eV which include a warning that assigning color r to vertex i will cause an energy increase, and the zero message 
M = (0, 0). In the following, warnings will frequently denoted simply by their indices r = 0, g. 
These distributions are self-consistently determined via the SP equations 



Pi-,j{h) = Ci^j / 1^ J]^ d'^UkQk^iiuk) 6 ih- ^ Uk \ exp < y ^ Ufe > (6) 

kev{t)\j \ kev{^)\j J [ \kev{t)\j J J 

Q^^J{u) = f dVi P,^j{h)s(u-u{h)^ . (7) 
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There the Ci^j are normahzation constants, the set V{i) C £ contains all neighbors of vertex i, and the functions u 
and oj are defined as: 



uj{h) = -min(-/i\...,-/i«) 

[h] = Lu{h — Cr) — Lu{h) . 



(8) 



These equations have a very nice interpretation: A site i receives an incoming field h as a sum of all but one incoming 
warnings. This field has either zero or negative entries, refiecting the anti-ferromagnetic character of the interaction. 
The maximal field components determine the colors of minimal energy if assigned to i. If this maximal field component 
is unique, a non-trivial message "Do not take this unique color!" is sent from vertex i to j via the last link. If the 
maximal field component is degenerate, the zero message is sent via link i j. 

Note the appearance of the reweighting parameter y > 0, which allows to scan metastable states of different energies. 
It acts similar to the inverse temperature in the usual Boltzmann weight: Different y concentrate the measure on 
different energy levels, and the limit y —> oo corresponds to zero-energy ground states. 

The corresponding y-dependent free energy can be calculated as a sum of node and link contributions, 



1 

N 



E (2/)-E('^^-i)'^"°'^(y) 



where di is the degree of vertex i. The expressions for (fi".^^{y) and (j>"°'^'^{y) are given explicitly by 



(9) 



0':f (y) 



In ( / d''hPi^j{h) <PuQj^i{u) exp|-j/ i^{h) - ui{h + u) | 



and by 



inode 



(2/) = -ilnjy H d'^UkQ 
y kev(i) 



k^t{uk) cxp ■( yuj \ Uk 

kev{i} 



From this free energy we can easily calculate both the complexity and the energy density e{y), 



-y 



> d^iy) 

dy 



e(y) 



dy 



(10) 



(11) 



(12) 



where the complexity is defined as the logarithm of the number of metastable states of energy e(y), divided by the 
number N of vertices. 

Proper colorings are characterized by the limit ?/ — > oo of the SP Eqs. (|6I7|I . In this case, positive energy contributions 
are forbidden and we work directly at zero energy. The SP equations can be brought into a much more handy form: 
They are reduced to the parameters Vi~>j7 i-^- to the probabilities to have an anti-ferromagnetic message for color r 
sent from vertex i to vertex j. Using the vectorial notation 



, kdi-i} all neighbors of i different from j, we have the closed iteration description 



(13) 



and denoting by V{i)\j — {ki, fc2. 



Vi^j ^ fd,-i{Vki^t,rfk2^i, ■ . . ,fjk^^_.^^i) (14) 

given component-wise by 

nfcgy(i)\j(l ~ Vk^i) ~ J^Ti^r nfcgv(i)\j(l ~ Vl^i ~ Vk'-^i) ^ ^ (-1)'' ^ 'n.kev(i)\j ^k^i 

Sl<Ti<g Y\keV{i)\ji^ ~ "Hk-^i) ~ X]l<Ti<r2<(3 Y\keV[i)\ii^ ~ Vk-^i ~ 'Hk-^i) + ' ' ' + (^l)"* ^ Y\k^V ''Ik-ti 

(15) 

for all T € {1, . . . , g}. The value of rjf^j can be calculated from the normalization constraint. 

The above formalism is formulated for the analysis of a single (tree-like) graph, but it can be easily modified in 
order to deal with average quantities on the random-graph ensembles Q{N^cJ(N_— 1)) or Oc{N), see Sec. Ill Bl General 
considerations on the existence of a well defined thermodynamic limit |26l l27l | imply the existence of a functional 
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probability distribution Q[Q{u)] describing how the surveys Q{u) are distributed on the edges of the graph. Noting 
that a g-component vector fj is sufficient to describe a survey Qi^j, we can explicitly write Q[Q{u)] as 



(Pf^ p{ff) s 



Q{u) - {I -j^rilSiu) 



T = l 



T = l 



(16) 



in terms of a simple g-dimensional probability distribution p{ff), with 5[-] denoting a functional Dirac distribution. 
The SP equations H6I7I) have to be interpreted in a probabilistic way: Drawing first an excess degree d with probability 
Td, cf. Eq. 0), and then d independently chosen surveys Qi{u), I — l,...,d, from Q[Q], we calculate 



/d 
1=1 



d^h Poih) S{u-u{h)) . 



(17) 
(18) 



The cavity equation for the functional distribution of surveys closes by the observation that the newly generated 
Qoiu) has to be again a typical survey drawn from Q[Q{u)]- 



Concentrating on the color-symmetric situation ?] 



dimensional piji). The limit y — > cx3 of the cavity equations is readily obtained: 

oo „ 

Piv) = '^'rd / drji pirn) ■ ■ ■ drjd KVd) S{r] - fd{rii, . . . ,7u)) 



77*^, the distribution p{7i) is reduced to a one- 

(19) 



d=0 



with 



fdivi 



,Vd) 



Ero(-i)'(V)nti[i-(^+ih^] 



It is also possible to give a closed expression for the complexity in the COL region (notice the pd instead of r^) 



(20) 



S(y = oo) = dryip(r?i)...dr;rfp(,?,)ln ^(-l)M J m[l-(^ + lH] 

d=i •' \(=o ^ i=l 



d=l 
C 

2 



drji p{r]i) dT]2 pir]2) In (1 - 9771772) . 



(21) 



D. The qualitative IRSB picture 

The formalism summarized above allows to determine not only the location of the g-COL/UNCOL transition 
for every q, but the order parameter allows also to extract important statistical information about structure and 
organization of the solutions. 

The most interesting point here concerns the set of solutions, seen as a subset of all possible configurations 
{1, q}^ ■ Every edge of the graph forbids a certain number of configurations. For small c, there is, however, still 
an exponentially large number of proper colorings. In addition, these are organized in a very simple way. They are 
collected in one very large cluster. For any two solutions, one can find a connecting path via other solutions, without 
ever changing more than 0(1) spins within one step of this path. 

This changes drastically at some average degree Cd- The set of solutions is still exponentially large, but it is split 
into an also exponential number of clusters. Inside each cluster, connecting paths as described above still exist, but 
any two clusters have an extensive Hamming distance from each other, cf. also Fig. ^ Technically the clustering 
threshold Cd is given by the first appearance of a non-trivial IRSB solution. The number of clusters, or more precisely 
its logarithm divided by N , is given by the complexity H21|l mentioned in the last sub-section. 

Inside these clusters, the first freezing phenomena are found. A large fraction of all vertices is frozen to one color 
in all solutions belonging to the same cluster. This color changes, of course, from cluster to cluster, since the initial 
model is color symmetric. Together with this clustering of ground-states, also an exponential number of metastable 
states appears. These have non-zero energy, but they are local minima of Hamiltonian 

The complexity of solution clusters decreases, until it vanishes at Cq. Beyond this point, the graph is uncolorable. 
There are still numerous ground state clusters, which have, however, non-zero energy but zero complexity. 
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FIG. 1: A pictorial view of the solution space structure. For small average degree c, all solutions (marked by full dots) are 
collected within one large cluster. At some transition point Cd, this cluster breaks down into an exponential number of separated 
clusters. Beyond Cq, i.e. in the UNCOL phase, there are still distinct ground state clusters, but they have non-zero energy 
(marked by empty dots). 



III. STABILITY CONDITIONS OF THE ONE-STEP RSB SOLUTION 



There is a growing believe by now that the cavity method gives exact results and not just approximations - provided 
that the replica symmetry is broken in the correct way. Even if a rigorous general proof is still lacking, a number of 
steps forward have been made so far in this direction. On this basis we conjecture that the results concerning the 
colorability threshold, as well as many features of the phase diagram (like for instance the existence of the clustering 
phase) are exact. 

In general however, the one-step RSB solution of such disordered models has no particular reason to be correct. In 
some models, the replica symmetry has to be broken infinitely many times to reach the exact solution. In the language 
of the cavity method one should thus consider an infinite hierarchy of nested clusters. This happens for instance in 
the Sherrington-Kirkpatrick (SK) model [2^, and Talagrand has recently demonstrated rigorously that the RSB free 
energy obtained in this way is exact p9l |. On the other hand there are models where IRSB is not an approximation, 
and no further steps of symmetry breaking are needed. This happens, e.g., in the random-energy model js^l and in 
the K-XORSAT problem [sil l3?| (a problem known in statistical physics as the diluted p-spin model). There the 
IRSB solution is known to be rigorously exact. 

Here we are going to determine whether, in the q-coloring problem, one step of RSB is sufficient or whether more 
steps have to be taken into account to get the correct solution. We do this by means of analyzing the stability of the 
IRSB solution against further RSB steps. To be more precise, we should formulate a 2RSB solution of the model and 
see if the IRSB solution is stable against a small 2RSB perturbation js^. This type of local stability analysis has 
receive a lot of interest recently, extending the seminal work of Elisabeth Gardner |3J| to finite connectivity spin glass 
models . It was clarified a lot in fli^pS l , and a formalism to deal with more general finite-connectivity problems 
has been established by now [T^ llTl Il9||. The coloring problem, as considered here, will allow for nice analytical 
treatments, in particular if considered on regular graphs. 

Let us rephrase the stability considerations for our problem, following the notation in Using Eq. Hl()|) the 

cavity equations for general y (|17I18|I can be rewritten in terms of the probabilities ,t = 0, q only. Formally this 
results in 

TyJ^Co e^-('^---'^) (22) 

(CTi,...,crd)^r 

where the sum runs over all possible combinations of input messages, < ai,...,<7d < Q, which induce the output 
message labeled by r. Remember that the special case y = 00 which concentrates on ground states, was given 
exphcitly in Eq. ifT^ . 

The IRSB solution is given by a color-symmetric distribution /5(r/), describing ry-fluctuations from link to link. 
Let us, for a moment, consider an arbitrary edge i — > j (together with a direction) which is characterized by one 
single y-dependent value of 77. In the IRSB formalism, there are many (meta)-stable states. In each of them, this 
link i j carries exactly one warning corresponding either to one of the colors, or being the zero message. For a 
randomly selected state of energy density e(i/) a specific warning is found with color-independent probability 77, the 
trivial message appears in a fraction 1 — g?7 of these states. This is schematically represented on the left side of Fig|21 

Let us now consider two steps of RSB. There we need to take into account the existence of clusters of states. 
Regarding small perturbations of the IRSB solution, two situations are possible [T8l |: 
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FIG. 2: A pictorial view of the two types of instabilities of the IRSB solution, represented here for a given link i — > j in the 
3-coloring problem. In each IRSB states (on the left), this link carries a message of a given color, or no message at all (white 
clusters here). Going from a IRSB to 2RSB, these states may aggregate into bigger clusters (TYPE-I instability) or fragment 
in new smaller clusters (TYPE-II instability). 



• TYPE-I: States may aggregate in configuration space, see Fig|21 In that case the order parameter (still for a 
fixed hnk i j) is the probability (?(?7^, ■•.,'7*) to randomly select a cluster in which there is a fraction i]^ of 
states with message — ei transmitted from i to j etc. Note that it is not sufficient to consider a probability 
distribution g{r]) of a single, color-symmetric fraction rj since the color symmetry can be broken inside a cluster. 
This is illustrated in Fig. |21 To study the stability of the IRSB solution, one should thus write the 2RSB 
equation, insert a perturbed IRSB solution and see if it evolves back to IRSB. In this case, the IRSB solution 
is obtain by considering g{r]^j ■■■jV'^) = ^iv^ ~ v) ' ' ■ Sij]'^ — rf). To test stability, we thus need to replace the 
Dirac peaks by narrow functions of width e, and to see if the width increases or goes to zero during the cavity 
iteration process. The analysis thus amounts to look if a small noise added to the solution will vanish. This is 
equivalent to testing convergence of the SP equations on a single graph. We will refer to this kind of analysis 
as noise propagation. 

• TYPE-II: States may fragment into new states, see the lower right picture in Fig|21 In the IRSB solution, a 
link i j carries a message which is uniquely determined within one state. If now this state fragments into 
one cluster of many states, also the single message is transformed into a set of messages. To introduce a small 
perturbation of the IRSB solution, we should assume that, with high probability, the states within one cluster 
are still characterized by the same message on link i — > j. In a small fraction of these states, however, also other 
messages may appear. The problem is now if or if not this perturbation tends to zero under iteration of the 
2RSB equations. The stability analysis thus amounts to see if a change in one message (called a bug in To^) 
can propagate through the whole system, or if it remains localized. We will refer to this instability, following 
the terminology of |19|| , as bug proliferation. 



A. TYPE-I instability: Aggregation of states and noise propagation 



Let us start with noise propagation. The stability can be computed from the Jacobian 



(23) 

IRSB 



which gives the infinitesimal probability that a change in the input probability rj^ will change the output probability 
riQ, cf. Eq. I|22|) . The index IRSB says that the expression has to be evaluated at the IRSB solution found within the 
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cavity approach. Note that we need to calculate this matrix only for 1 < cr, r < g since the probability of the zero- 
message follows by normalization and thus does not describe an independent quantity. After one iteration, a change 
of one input message of a vertex of degree d induces a change in d — 1 outgoing messages. The global perturbation 
after n iterations thus concerns on average (X^d drd)^ cavity messages, where rd is computed from Eq. Q). To monitor 
also the strength of the perturbation, we have to calculate 



rfifdi • ■ ■ dnrd„ tr((TiT2 • • • T„ 



(24) 



where Ti, ...,T„ are n successive T-matrices. The notation (•)^ denotes the average over the external messages to the 
nodes 1, To be more precise we should write Ti — Td^irji^i, ...,rii^di)i i-e. Ti depends on di incoming probabilities 
rji^i, ...,r]i^di, with di being the excess degree of vertex i distributed according to rdi- For i = 1, all messages are 
external. They have to be generated independently from the IRSB order parameter p{r]). For i > 1, the first message 
described by rji^i results from node i ~ 1 according to Eq. (|22ll . The other inputs 77^,2, Jyi.di are again external 
and thus independently distributed with p{r]). Note that the ?7-averaged matrix ((TiT2...T„)^)^ still depends on the 
random excess degrees di, c?„. This has to be taken into account in the sum in H24|) . 

A simple way to calculate the trace for n ^ 1 is to consider the biggest eigenvalue of the matrix ((TiT2...r„)^)^, 
which allows us to use a single number to follow the perturbation. This, however, becomes very simple for q-COL. 
Let us rewrite the Jacobian: 



(25) 



IRSB 



Evaluated at the color-symmetric IRSB solution, this matrix has only two different entries: All diagonal elements are 
equal, and all non-diagonal elements are equal. As an immediate consequence all Jacobians commute and are thus 
simultaneously diagonalizable. The matrix T has only two distinct eigenvalues. 



Ai 



A2 



dril 



IRSB 



(26) 



IRSB 



The second eigenvalue belongs to the homogeneous eigenvector (1,1,..., 1). It describes a fluctuation changing all 
rjl, T — 1, q, by the same amount, i.e. a fluctuation maintaining the color symmetry. The first eigenvalue is (q— 1)- 
fold degenerate, eigenvectors are spanned by (1, —1, 0, 0), (0, 1, — 1, 0, 0), (0, 0, 1, — 1). The corresponding 
fluctuations explicitly break the color symmetry, and they are in fact found to be the critical ones for the instability 
of the IRSB solution. 

For zero-energy ground states in the COL phase, where y = 00, one may proceed using the closed analytical 
expression (|15ll for the iteration. It is possible to explicitly write down both eigenvalues. We will see in section IVTI 
how to derive asymptotic results at large q. Finally, to monitor the perturbation, one should consider the dependence 
of 



A 



TYPE-I 



(n) 



E 

di,.. . 



dlVd, ■ --dnTd^ ^({Ti-T„y)r, 



(27) 



on the number n of iteration steps. There ^((TiT2...T„Y)r, denotes the biggest eigenvalue of matrix ((TiT2...T„)^)^. It 
can be computed as a product of eigenvalues being either all of type Ai or all of type A2. The IRSB solution is stable 
against TYPE-I perturbations if and only if XTvPE-iin) decays to zero in the large-n limit. 



B. TYPE-II instability: Fragmentation of states and bug proliferation 



As discussed above, we have to consider bug proliferation in order to study the second type of instability, i.e. the 
instability with respect to fragmentation of states into clusters of states. Suppose that a message of type cr is turned 



9 



into another message a. This is called a bug in We suppose it to happen with small probability n'^^'^ <C 1. As 
a consequence, some output messages may change, and the bug propagates. The system is unstable with respect to 
TYPE-II perturbations if such a bug propagates through the whole system. 

Considering small perturbations, we can work in the linear response regime. For a generic change of the first input 
message in Eq. (|22|) . we may calculate the probability of changing the output message: 

TT^^^^Co J2 ^^^vT-'-vT ey^'^^-"'--""^ (28) 

(cr,<T2,...,<Td)-*T 

(cr,cr2,...,<Td)^T 

This defines a matrix V with entries 

V - - (29) 

evaluated at the IRSB solution. Since a message may have q + I different states and we are considering actual 
changes in messages, is a square matrix of dimension q{q+ 1). We study it in Appendix IXI and show that its biggest 
eigenvalues can be written, at any given y, as a function of q and rji, rj^i as 

A = (g - 1) Aiq, M) + ^Biq, {77}) B'iq, {77}) (30) 

where A, B and B' are explicitly given in Eq. (jA4|) in App.fXl In the zero-energy case, i.e. for y 00, this expression 
simplifies to 

X={q-l)Ay=oo{q,{v}) (31) 

where Ay=ac{q, {rj}) is given by 



Ero('i)'e7')nt2[i-a+2)r/.: 



The instability with respect to bug proliferation has to be determined from a product of n such matrices. Prolifer- 
ation is monitored, using again Eq. Q), by 

dird^ ■ ■ ■ d„rd^ tr{ViV2 ■ ■ ■ Vn)rj (33) 

or, using the eigenvalue notation, by 

\TYPE-ii{n)= ^ diTd^- ■ -dnTd,, \{ViV2-v^)^ (34) 



where A^y^y^ . y^)^ is the biggest eigenvalue of matrix {ViV2.--Vn)ri- The averages are performed in complete analogy 
to averages in the TYPE-I case. The IRSB solution is stable if and only if this eigenvalue decays to zero for n ^ oo. 



IV. STUDY OF THE TYPE-I INSTABILITY: NOISE PROPAGATION 



Using the criteria derived in the previous section, we can now study the stability of the IRSB solution for the ground 
states of the coloring problem, i.e. we concentrate first on y ^ oo. We will, in the following, call cgp connectivity 
beyond which IRSB is TYPE-I unstable because, as we will see, the SP equations do not converge anymore on a 
single graph for c > cgp. 



A. Regular random graphs 



In the case of fixed-connectivity graphs all sites are equivalent, so 77 does not fluctuate from edge to edge. Therefore, 
the recursion equation (j2()|l can be simplified to 

,^ _ ... _ Ero^(-i)'(V)[i-a+i)# .35. 
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with k = c — 1. This equation can be easily solved using basic numerical tools. Similar simplifications arise in the 
computation of the complexity S. Within the IRSB formalism, it is therefore very easy to derive r](q, c), to compute 
the corresponding complexity, and thus to determine if the graph is in the COL phase or not. One finds that that 
at small connectivities the solution is trivial, i.e. rj = 0. For c > Cd the clustering RSB phenomenon occurs, and for 
c> Cq the graph becomes uncolorable. To test the validity of the IRSB ansatz we are now going to apply the criteria 
derived previously. Due to the fixed vertex degree, there is no need to average over the site distribution. The stability 
criterion thus simplifies considerably to 

In Tab. we summarize the values of rj and S, as well as stability criteria, for q — 3, A and 5. The instability of 
TYPE-I appears at high connectivity, in the UNCOL phase. So it turns out to be irrelevant in the COL phase. This 
turns out to be true for arbitrary number q of colors, see Sec. IVII for the asymptotic case. However, this instability 
is directly relevant for the behavior of the SP algorithm on single graphs: When c > csp, SP stops to converge on a 
single graph. This is actually what the Jacobian (|23|l implies first of all. We have verified this numerically for 3-COL, 
SP does not converge on regular graphs of degree equal to or larger than 6. 
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UNCOL Instable 


0.1966 


-0.4109 


1.017 



TABLE I: Computation of rj, E and Xtype-i for g = 3, 4 and q = 5 and various values of tlie connectivity on regular random 
graphs 



B. Fluctuating-connectivity random graphs: Erdos-Renyi ensemble 

Let us now turn to Erdos-Renyi random graphs. Sites are no more equivalent and the order parameters rj fluctuate 
from link to link. We have to be very careful in the disorder average in the stability criterion, a detailed description 
was given above. The TYPE-I instability amounts to see if a small change in rj propagates through the whole system. 
We saw in section IIIII that this can be computed by the study of a Jacobian that describes the propagation of a 
perturbation after one iteration. The global perturbation after n iteration is monitored by the sum of the squares of 
the perturbed cavity-bias, which behaves like 

>^TYPE-i{n) = ^ diVdi ■ ■ ■ dnrd„ X{{TiT2---T„)'^)„ (37) 

di,...,dn 

where '^{{Ti---T„y),, is the maximal eigenvalue of the 77-average of matrix (Ti---r„)^. The system is stable if 
^TYPE-iin) goes to zero with n, and instable if it diverges. Let us concentrate for a moment on 3-COL: Using 
the SP equation on single graphs, we saw that the iteration is not converging anymore for c > 5.01. Evaluating 
the stability criterion, we were able to reproduce this number analytically (see Fig|3J). Both methods agrees exactly. 
We have summarized our result for csp in the last column of Tab. IIIII Again, the conclusion is that the TYPE-I 
instability is irrelevant for the thermodynamics of the system. 
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FIG. 3: Analytic and numerical computation of the location of the TYPE-I instability point in the 3-coloring on fluctuating 
connectivities graphs. We find c(3)sp ~ 5.01 in both cases, which confirms well that SP equation stop to converge when 
c > CSP- 

*J?S»* RSB 

* msB * 

-t Gol Uncol 3- 

FIG. 4: Schematic phase diagram for the coloring transition at e = 0. At small connectivity, when c < Cd the problem is COL 
and the space phase is trivial. Then, for c > Cd the phase space is broken into many states and RSB appears. In this RSB 
phase there is a one-step RSB zone Cm < c < csp in which detailed computations are possible. The COL/UNCOL transition 
appearing at Cq is always found in this zone. Note that for some connectivities, Cd and Cm are equal. 



V. STUDY OF THE TYPE-II INSTABILITY: BUG PROLIFERATION 



We turn now to the instability of TYPE-II. Like before, we start with regular random graphs, and then we move 
to Erdos-Renyi graphs. In (l^, it is argued that, if IRSB states are instable, then these states should be described 
using a full RSB ansatz, and therefore are marginally stable |3It| . The connectivity at which the instability appears 
was therefore called Cm for Cmarginai- Although it is not yet clear if full RSB is needed in the case IRSB is unstable, 
we follow this notation and call Cm the connectivity at which the TYPE-II instability shows up. 



A. Regular random graphs 

Using again the homogeneous solution, it is very easy to write the stability criterion. It reads (see appendix IB)| 

^TYPE-II = - 1) w „ wi 7, — TTIT- < 1 (38) 



with k c— 1. Applying this criterion, it turns out that the instability does not appear at low q. In fact, for q < 6, the 
ground state in the COL region is always either RS or IRSB, no instability toward 2RSB is observed. However, when 
g = 6, the lowest connectivity for which RSB appears turns out to be instable. More generally, there is an unstable 
zone growing with q before the stable IRSB region is reached. We summarize our result for small q in Tab. |nj Note 
that Cd is calculated in IRSB approximation, if the latter is instable its position may change due to more-step RSB. 

The one-step solution is thus not always stable in COL phase. It seems, however, to be correct close to the 
COL/UNCOL threshold, the result for the latter is therefore conjectured to be exact. The global situation is schemat- 
ically represented in the Fig.0| We have derived the solution of this problem up to g = 200, which allows us to draw 
a quite complete phase diagram for the problem on finite connectivity random graphs, see Fig. \S\ From this phase 
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12 


46 


48 


57 
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TABLE II: Critical connectivities of the coloring problem on fixed connectivities random graphs. 




FIG. 5: Phase diagram of the q-coloring problem on random graphs with fixed connectivities, for 3 to 200 colors. From bottom 
to top we have four line corresponding to Cd, Cm,Cq and csp. The critical line Cq that separates the COL and the UNCOL 
region is always in the stable zone, where we conjecture our results to be exact. Note however the small, but slowly growing, 
zone in the COL region where the IRSB computation is unstable. 



diagram, two important points have to be noticed: 

• The critical line for the COL/UNCOL transition is always in the IRSB stable zone, and therefore we believe it 
to be be an exact result. 

• There are two zones where the IRSB solution is unstable. This is happening at high connectivity in the UNCOL 
phase, due to a TYPE-I instability, and also in a small, though growing (with the number of colors) zone between 
the RS COL and the IRSB COL phases, due to a TYPE-II instability. 



B. Erdos-Renyi random graphs 

Let us now go back to the Erdos-Renyi ensemble. As mentioned above, the instability is monitored by 

where the di has the characteristic Poissonian distribution r^. of average c. As already explained above, the 77-values 
are given in the following way: For the first node i = 1, all values of 771,1, rji^di drawn independently from p(r]). 
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FIG. 6: Computation of the location of the TYPE-II instability point in 3-COL on Erdos-Renyi graphs. One finds Cm ~ 4.51. 

This allows to calculate the first factor in the above product, but also the first input message 772,1 = ■■■tViAi) 
to the second node. The other (i2 — 1 input messages 772,21 ■■■,V2,d2 drawn again from p(77). This is repeated for 
the other vertices: The input message on the link coming from the previous node is induced via the SP equation, 
the others are drawn randomly. Using again the population dynamic solution of the coloring problem, it is easy to 
perform this average. For 3-COL, e.g., we find the instabihty to be present for connectivities c < c,„ ~ 4.51, see 
Fig. ini) We have repeated this study for g = 4 and 5. Our results are summarized in Tab. IIIII As opposed to what 
happens in the fixed-connectivity case, there is an unstable interval close to the clustering transition at g = 3,4. For 



q 


Cd 


Cm 


c. 


CSP 


3 


4.42 


4.51 


4.69 


5.01 


4 


8.27 


8.34 


8.90 


10.21 


5 


12.67 


12.67 


13.69 


17.1 



TABLE III: Critical connectivities of the coloring problem on Erdos-Renyi graphs. 

q — 5, however, it is always stable. As we will see in the next section, at larger connectivities the unstable zone 
re-appears. Again, the most important conclusion is that the critical values for the COL/UNCOL transition are in 
the stable zone. We thus conjecture that the connectivities for the COL/UNCOL transition are exact. 

VI. ASYMPTOTIC THRESHOLD VALUES IN THE HIGH-g LIMIT 

Whereas the precise values for transitions and stability regions of the IRSB approach have to be determined by 
numerically solving the cavity equations, and by plugging this solution into the stability criteria, an asymptotic 
analysis can be carried through analytically in the limit of a large number of colors, i.e. for 5 3> 1. 

As an astonishing result, Poissonian and regular random graphs have the same asymptotic behavior, at least what 
concerns the dominant contributions. Degree fluctuations play a role only at low-order contributions which, due to 
the complexity of their determination, are not considered here. To lighten the presentation, we consider therefore 
only the case of regular graphs, i.e. of equal vertex degrees c = fc -I- 1 for all vertices. 

As discussed above, in this case the IRSB solution is factorized and thus completely determined by a single edge- 
and color-independent 77. A basic numerical observation in this context is that, for q ^ 1, this 77 approaches 1/g, 
whereas the probability jf^l — qr] of sending a trivial message vanishes asymptotically. We can therefore write 
qr] = 1 — g{q,c), where the positive, but still unknown function g{q,c) will be shown to vanish in the large-g limit. 

Note that, in the case of fluctuating vertex degrees, also 77 will fluctuate according to some non-trivial pij]). For 
large q, this distribution concentrates, however, more and more sharply around its average value. A careful analysis 




14 



of the influence of these fluctuations shows that they contribute only to sub-dominant terms in the threshold values, 
i.e. they can be neglected to the considered orders in q. 



A. The clustering transition in IRSB approximation 



The clustering transition is characterized by the onset of a non-trivial solution of the survey propagation equation 
l)19|l . To determine this point, we consider a graph of constant degree c = k + 1 with 



k = q[lii q + Inln q + a] 



(40) 



There a is an arbitrary constant, i.e. in the largc-g limit k is determined down to 0{q). Doing so, we zoom directly 
into the region where the dynamical transition appears: We look for an ad at which the first non-trivial rj appears as 
a solution of 



(41) 



As a first step we realize that, for ~ l/q and k as given in 140() . the sums in both the numerator and the denominator 
are dominated by values oi I ^ q, where we can replace 



[1 - If]]'' ~ exp{-?7A:/} 



exp{— [1 — g{q, a)] [In q + hihiq + a] 1} 



1 



qlnq 



(42) 



up to neglectable corrections. Note the change in notation from g{q, c) to g{q, a). Plugging this into Eq. H41(l . we find 



1 - 5(9,") - 1 — 9 
mq 



exp{-j^ e-"} 



2\nq 



This equation closes for g{q,a) = 7(a)/ In g, with the condition 

7(a)e-^(") = ie" 



(43) 



(44) 



The maximum of the left-hand side is 1/e for 7 = 1, i.e. a real solution for 7(0;) exists if and only if a > 1 — In 2. We 
thus find the dynamical transition in the IRSB approach at 



Cd = q [lng + lnlng+l-ln2 + o(l) 



(45) 



In q 



+ o{\nq 1) 



The result is equally valid for Poissonian random graphs. This can be understood immediately: Degree fluctuations 
are of 0[^/c) = 0{^Jqhiq), and they are thus neglectable compared to the contributions in Eq. 1^01) . 



B. The COL/UNCOL transition in IRSB approximation 

From the upper bound of Luczak and the lower one of Achlioptas and Naor we know that 

Cq-^2q\nq-\nq + 0{l) . (46) 

We are going to re-derive this result from the cavity equations, and we also determine the previously unknown 0(1)- 
contribution. Observing that, under this scaling of the degree, the contributions in the self-consistency equation 141f) 
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behave as [1 — Irj]'' ^ q we find 



- 1 - 77 (9 - 1) cxp <^ -krj - -krj 




This equation is solved by 



1 
V 



3 Inq 
2g3 



0{q- 



independently on the C'(l)-terni in H4t)|l . This value has to be plugged into the expression for the complexity, 



E = ln^(-1) 



" J J[l-a + l)r;]M -^ln(l-g,7') 



(47) 



(48) 



(49) 



in order to determine the critical point Cq of the COL/UNCOL transition from the vanishing of S. Keeping only the 
three dominant terms in q, we immediately find 



Cq ~ 2(7 In g — In (7 — 1 + o(l) 



(50) 



The result is equally valid for Poissonian graphs, and it coincides precisely with the upper bound of Luczak, i.e. his 
improved annealed approximation is asymptotically exact. It is, however, also only one larger than the conjectured 
lower bound by Achlioptas and Naor. In this way we see that the IRSB approach is not only consistent with the 
mathematical bound, but allows for a more precise (even if not rigorous) determination of the threshold value. 

C. Asymptotic stability 

To check the validity of the above results, we have to certify that the IRSB solution is locally stable. Again we 
discuss only the case of the regular random graph of degree c = fc + 1 , but the results do not differ in the Poissonian 
case. 

1. Instability of TYPE-I 

Let us first start with the instability of TYPE-I. To do so, we should compute the eigenvalues of the stability 
matrix derived in Sec IIVI For average connectivities c 3> g, the recursion equation l|15(l is dominated by the first 
contribution in both the numerator and the denominator, leading to p(r]) = S(j] — l/q) in leading order. Corrections 
are exponentially small in c/q and can thus be neglected. More precisely, taking the asymptotic c ^ q results of 
Eq. H14() and then using the fact that all colors on all branches except the perturbed one share the same 77, one obtains 



(1 - 7?[) (1 - r,) 



fe-i 



(51) 



We immediately compute the derivatives I 'Wr^\ri'^=ri form the entries of the Jacobian matrix T given 
in Eq. (jSSJ: 



(52) 



Using eigenvalues H2t)f) , we finally find for q 



1 

' q-l 



(53) 



=v 
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Therefore, the stability criterion reads 

< ' ^''^ 
such that the instabihty appears at connectivity greater than 

Csp^{l-q)^ + l-q^ -2q + 2 + o{l) . (55) 

In fact, this formula gives very good results even at small q for both Erdos-Renyi and regular random graphs. The 
reason why it is working so well is that we are considering connectivities growing like q^, so the condition c S> (? is 
satisfied very fast. 

The dominant contributions give always an integer csp- In particular in the case of regular graphs it is therefore 
important to know at least the sign of the next-order term. Numerically we find this to be positive for arbitrary q. 
Since we define csp as the first unstable connectivity, we should therefore write 

c^r" = {q- 1)' + 2. (56) 

This formula amazingly appears to be exact even for q — 3. Note that, due to its q^ dependence, as compared to 
the order q\ogq of the g-COL/UNCOL transition, this instability is always located in the UNCOL phase and is thus 
irrelevant to the physics of the problem. It is however useful since it tells us when the SP equations at y = oo stop 
to converge on a single graph. The finite q result is also extremely well approximated by the q limit in the case 
of fiuctuating connectivities, giving 

cf« = l + (g-l)2 + o(l). (57) 



2. Instability of TYPE-II 

Finally, we have also to check if a TYPE-II instability exists. To do so, we start our investigation close to the 
clustering transition, i.e. we go back to the scaling 

k = q[\nq + lii\nq + a] . (58) 

As shown before, this results in 

1 



v = - 
q 



l--^ + o(lng-i) 
In q 



(59) 

where 7 is the smaller of the two solutions of 276^''' = e^". The stability criterion (|H8|I reads 

A..,.-„^%-i)^"'r"''7')''-"+^""""<i^ (60, 

E?;„(-i)'Wi)[i-(' + i)>ii' 

Plugging in Eqs. H58() and (|59|l . and keeping only the leading order terms, we find 

XTyPE-n = e^- + o{^^^ (61) 

For sufficiently large g, this becomes smaller than one if and only if e'*'^" < 1 which, according to the condition for 7, 
holds for all a > 1/2. We therefore conclude that IRSB is stable for all connectivities larger than 



Cm q 



lnq + ln\nq + i + o(l) 



(62) 



This value is slightly larger than the one of Cd, i.e. there is a linearly growing gap between the onset of a non-trivial 
IRSB solution and its stability. Note, however, that the relative distance of both expressions is decreasing, the two 
leading terms in and c„i coincide. 

It is very interesting to compare this behavior with the best lower bounds obtained from heuristic algorithms. As 
discussed in more detail in the introduction, these have the leading term qlnq. This results supports the intuitive 
feeling that local linear-time algorithms are not able to enter the clustered phase due to the proliferation of metastable 
states of non-zero energy. It would be interesting to understand better the influence of the structure of the energy 
landscape on the behavior of local algorithms, because this interplay between static and dynamic features may be a 
useful tool in systematically improving algorithms. 

Note in this context also that the clustered phase covers half of the COL phase for large q. This means that, 
compared to, e.g., 9 = 3, the influence of the clustering phenomenon becomes asymptotically more and more important. 
In this sense, a full understanding of the clustering phenomenon beyond the IRSB approximation is of crucial interest. 
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FIG. 7: Complexity IRSB and its IRSB stable portion for the 3-coloring problem on graph with fixed connectivity c = 5 (left), 
an for the the 4-coloring at c = 9. Both system are in the COL Phase, and ground state, as well as low energy states, are 
described by a IRSB ansatz. However, at high energy, when e > e™, excited states should be described by a different ansatz, 
probably F-RSB. 



VII. THE FINITE ENERGY PHASE DIAGRAM 



It is very interesting to consider also finite-energy states, both for physical and computer science motivations. 
Indeed, the nature of these states and their degree of RSB provide some crucial information for the physical picture of 
the model and for its finite-temperature phase transition. Thermodynamically, the anti-ferromagnetic Potts model on 
random graphs behaves in fact like a Potts glass [solfi^ and it is thus interesting per se. For instance, one would like 
to know if, depending on the number of colors and on the connectivity, the system behaves like a p— spin model and 
has a IRSB transition, or like the Sherrington-Kirkpatrick model which has a continuous transition. From the point 
of view of combinatorial optimization, it is also widely believed that the time dependence of local search algorithm, 
and the performance they may reach, is also related to the structure of the energy landscape 36, 37, 38] . It has been 
shown very recently that it is possible to predict a threshold where a slow annealing will end |lj| using the statistical 
physics information on the nature of excited states. 

The basic question we will answer in this section follows from 13], where the complexity S(e) where computed 
using the IRSB solution. We know now that the IRSB solution is in fact not always correct, so we would like to know 
which part of the complexity curve is correct and which part is not. 



A. Regular random graphs 

Let us first consider regular graphs. In Figs. |3 we plot complexity versus energy, i.e. the logarithm of the number 
of states, divided by N, versus their energy, for 3-COL on 5-regular graphs, and for 4-COL on 9-regular graphs. Both 
cases are in the COL phase which implies Y,{e = 0) > 0. We already showed in this paper that the IRSB approach is 
stable for these two cases at e = 0, and therefore the computation of E(e — 0) is correct. 

Extending our work from e = to positive energies, we indeed find that only the TYPE-II instability is relevant in 
the physical part of the phase diagram (i.e. where complexity is non- negative) . Using Eq. H34|) at finite y, we observe 
that a large part of the positive-energy solution is IRSB unstable, i.e. only a small fraction of the complexity curve 
close to the ground state is exact. Therefore, using the notation ec (for Gardner energy 13^) for the highest energy 
for which IRSB is stable, the IRSB computation of the complexity is only valid for energies < e < ec, and should 
be modified for ec < e < e^- 

Two questions are arising. First, how to compute the correct complexity when IRSB is unstable? Of course, it is in 
principle possible to use a 2RSB ansatz, but this is technically much more demanding, and again the stability towards 
3RSB should be tested. It is still a matter of debate how to modify the complexity when IRSB is unstable: The usual 
paradigm is that, if not described by IRSB, then states should be described by a full RSB ansatz (since all known 
IRSB unstable models seem to be described by full RSB). Assuming that this is indeed the case, we have to face the 
question of the complexity of full RSB states, which again is still very controversial. While some authors argue that 
full RSB states have a vanishing complexities, the question remains unsolved and has recently seen a renewed interest 
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FIG. 8: Finite energy phase diagram of the 3-coloring. egs gives the ground state energy density, ea the energy density of the 
IRSB stable states of highest energy, and and the threshold energy in IRSB approximation, i.e. the energy below which a 
IRSB solution exists. 

(see mi). 

Another very important question, using these complexity curves, would be to predict dynamical thresholds, like 
in [l^ . That would be a very natural extension of this work to dynamical studies. 

B. Erdos-Renyi random graphs 

For Erdos-Renyi random graph, we concentrate on 3-COL. Instead of showing further complexity curves, we prefer 
to display, in fig. |S1 a phase diagram which we think contains a good summary of this work. The following points are 
to be commented: 

• First, in full line, we display the ground state energy Cgs versus the average connectivity c. It is zero below Cg, 
in the COL phase, and positive when c > Cq ~ 4.69, i.e. in the UNCOL phase. This has been calculated using 
the energies for which the complexity is exactly zero. 

• The energy ea is the threshold energy, i.e. the energy below which the clustering transition appears, and where 
replica symmetry is broken. In IRSB approximation, it is non zero for c > — 4.42. 

• The energy ec is the Gardner energy, which tell us where the clustering phenomenon becomes IRSB unstable. 
For egs < e < ec, states are IRSB stable, whereas they are unstable for eg < e < e^. This energy line starts at 
Cm (note that Cd < Cm < Cq) and allows us to determine in which zone of this phase diagram IRSB is correct. 
It crosses the ground state energy line at cg — 5.08 (we use again the terminology of stand for Gardner 
connectivity). Therefore, for c > cq, the problem is never IRSB (at least for physical, i.e. positive-complexity 
solutions). 

• Going back to the e = line, we have finally, in the UNCOL phase, the point where the SP equations stop to 
converge on a single graph. This happens for all c> csp — 5.01. 



VIII. CONCLUSION 

In this paper we have studied in detail the limits of IRSB approximation for the g-coloring problem on random 
graph of both fixed and Poissonian degrees. The IRSB approximation shows two different kind of instabilities: a 
TYPE-I instability where the clusters of solutions tend to aggregate, and a TYPE-II instability where solutions inside 
a cluster form further levels of clusterization. In the colorable phase, we find in particular the latter kind of instability. 
Luckily enough, at any 9 > 3, the COL/UNCOL transition is found in the IRSB stable region, making it plausible 
that the values presented in Tabs. HTl and IIIII are in fact the true q-COL/UNCOL thresholds. 
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In the limit of g ^ oo many simplifications can be done in the calculation, allowing for a fully analytic treatment of 
the problem, and also for checking our results against the rigorous ones presented in |5|,(^J. We find the COL/UNCOL 
threshold to be asymptotically Cq = 2q\ogq — \ogq — 1 + o(l) which on the one hand coincides precisely with the 
rigorous upper bound of Luczac 5] , which on the other hand differs only one from the lower bound of Achlioptas 
and Naor |6j|. All these findings are good news for the IRSB cavity approach, that not only turns out to be consis- 
tent with independently established rigorous mathematical results, but also allows for sharper, though not rigorous, 
determination of threshold values. 

There are several lines in which the work presented here could be extended. The most straight-forward direction is 
probably the question of how to implement a 2RSB calculation, in order to understand the phase-space structure in 
the IRSB unstable region, and to see in how far the IRSB approximations for the clustering transition, the complexity 
and the threshold energy have to be changed. 

A second interesting direction concerns the connection between the failure of linear-time algorithms and the onset 
of clustering. Even if a connection between both seems pretty intuitive due to the existence of exponentially many 
metastable states, one should keep in mind that algorithms do not follow a physical dynamics with detailed balance 
etc. The connection between the energy landscape and the configurations explored by the algorithm is therefore far 
from beeing obvious. Even if so far no local linear-time algorithms were found that solve g-COL inside the IRSB 
stable region, their mathematical analysis has no obvious connection to the landscape properties of the model. It 
would therefore be extremely interesting to either establish this connection or to prove its non-existence. 
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APPENDIX A: INSTABILITY OF THE SECOND KIND 

In this appendix we study the eigenvalues of the matrix V defined by Eq. (|29|) , 

- (Al) 

To compute this matrix element from Eq. (|28|l one needs to consider all configurations of messages ((T2, aj) on the 
incoming links 2, 3, .., d such that 

(a) if the incoming warning on link 1 is given by cr, the warning t is induced at the output, and 

(b) if the incoming warning on link 1 is given by ct, the warning f is induced at the output. 

Since we have q + I different messages labelled by 0, q, we have to deal with a {q{q + 1)) x {q{q + 1)) matrix. 

A physical interpretation of recursion © is that, when a site is exposed to a g-component field h = {h^,...,h'^) 
(entries are all negative or zero, reflecting the anti-ferromagnetic nature of the Hamiltonian) then 

(i) if there is a unique maximal field component h'^ , the site will send a message — eV saying "Do not take color r" 
via the outgoing link (in the following we call this a warning of color r), and 

(a) it will send a zero message otherwise. 
Let us study all elements, case by case, first when a zero message is changed to a messenger with a color, then a color 
is changed into another one. 



1. Changing to a colored message (or a color to 0) 



When one changes one input message to a colored one, e.g. — ei, it can be shown that most corresponding matrix 
elements vanish. A simple way to show it is the following: Consider link one, where we are changing the incoming 
message, and the other d — 1 incoming links. In the first configuration, link one is sending a 0, so it has no effect. 
The other d—1 incoming messages induce a field h. If we turn now the message on branch one to — ei, this has the 
effect of decreasing field component by one. The outgoing message is non-zero only if there is a unique maximum 
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field, this change has no effect at all if was not a maximum field component. To obtain non-zero matrix elements, 
we thus have to consider only situations in which this was a maximum field before changing the first message. 
Two cases are arising: 

1) First, if was the only maximum component before the change, then the output was ei in the first configuration. 
Adding a new incoming message of color one decreases by one unit. Now either is still the unique maximum, 
and nothing has changed, or it now equals to another field, say h^, and the new outgoing message becomes 0. 
There is thus a finite probability to change the input from color 1 to by changing an input to color 1. 

2) Second, if was not the maximum field and if they were two such maximum fields, say and h^, then adding 
an incoming message of color 1 forces to decrease, and therefore may become the only maximum and the 
new output is — 6*2- 

There are thus only two non zero terms, that we will call A and B: 

U = i;o..o.. (A2) 
l^o — vi^o,o^i 

All other non-zero terms having a zero input changed to a colored one, result from simple color permutations and 
equal the two described ones. Note that similar equations can be written when one changes an input of color 1 to 0. 
In this case the only non-zero terms are given by 




(A3) 

The values of A,A',B and B' are associated to the two following situations (considering the set of fields h resulting 
from the other d — 1 incoming messages): (a) A{q,{r]}) and ^'(g, {t;}) are probabilities of having and as the 
two only maximum field and (b) B{q, {ij}) and B'{q, {ry}) are the probability of having as the maximum field, and 
that there exists at least one other field with value h — — 1. To obtain the final matrix element one has to put the 
reweighting factor corresponding the to second configuration, therefore we find: 

A{q,M)^A\q,{v}) ^ Co J2 P{h\h\h\..,h'i)ey'^" (A4) 
B'{qAr}}) ^ Co P{h\h\h\..X)e"'^' 

(li^ ,..,h''>h^) 
{h^-l=max{h^,h^,..,h'')) 

B{q,{v}) ^ Co P{h\h\h^..,h'^)ey^^'-^'> 

[h^ ,..,h''>h^) 
{h^ -l=max{h^ ...Ji")) 

where we have introduced, following [13i] . the notation P{h) as the probability of having a configuration of messages 
that gives a set of fields h before any reweighting is done. Thus P{h) would result from Eq. H17|) by setting y — Q. 
This notation will be of great use in the computation of the appendix. Note that, to compute P{h) we consider here 
only messages arriving from the d — 1 unchanged links 2, 3, ...d. 

Finally, another very important property for the structure of the matrix is that, by changing to ct, one cannot 
change a color r to another color f in the output. This will turn out to considerably simplify the problem. 



2. Changing one color to another color 

We now consider a change of the first incoming message from color 1 to color 2. One can show, using similar kind 
of reasoning, that Vi^2,i^2 = V'3^2,1^2 = ^1^3,1^2 = ^2^3, 1^2 = V"3^i,i^2 = and that ^2^1, 1^2 = A. Having in 
mind these relations, we can now write the stability matrix. 
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3. The stability matrix V 

We write the stability matrix V in a way that justifies why we did not care about some terms in the previous 
section. In the base we chose, the matrix is block triangular, so we will need only to care of diagonal block matrices 
to compute eigenvalues (since, in a triangular block matrix, eigenvalues are eigenvalues of the diagonal matrices) . We 
write 



V 





(0,e) (6,0) 


(e,6') 




'(0,ei) 






•1 


(0,62) 








.(0,e,) 
'(ei,0) 


iVi 


VV 


•1 


(62,0) 








(61,62) 








(62,63) 






9(9-1) 


(63,62) 
^(62,61) 





z 



(A5) 



where M and Z can be written 
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4. Eigenvalue analysis 

We are now ready to find the biggest eigenvalues of the matrix V, which will be the biggest of all eigenvalues from 

2 _ 2 _ 

Z and M. The matrix Z has ^ ^ eigenvalues —A, and ^ ^ eigenvalues A. Eigenvalues of M can be easily studied 
using again its block matrix structure; the biggest one is found to be (q — 1)A + \fBB' . Thus the biggest eigenvalue 
of the whole stability matrix is 



A = (q - l)^(g, {^\) + ^/Bi^MW^M) 



(A6) 



where A, B and B' are defined by Eq. ljA4|l . In the y — 00 case, new simplification arises because B is associated with 
contradicting messages and it is annihilated by its reweighting factor. Thus 



Ay^oc- = (9 - l)A=oo(9, {v}) 



where Ay=oo{q, {v}) be explicitly computed (see Appendix^ and reads 



(A7) 



Ay^ooiqAv}) 



nl2[i-(^+2)^d 



ELo(-i)'Gynti[i-(^ + i)^.] 



(A8) 
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APPENDIX B: INSTABILITY OF THE SECOND KIND AT y = oo 

Let us concentrate on the colorable phase, where the ground states are characterized by ?/ = oo. Here we give an 
explicit computation of Ay=ao- Let us concentrate for the derivation for 3-COL. Following the notation of App.^ we 
denote P{h) the probability of having a configuration of d warnings that sum up to the field h without reweighting. 
It has no direct physical meaning but it is of great technical help in the present computation, cf. 13] . 

We first need to calculate the value of the normalization constant Co in the y oo limit. Since reweighting is 
killing any term with positive energy shift, the only surviving terms in the recursion are those where all fields have at 
least one zero component, allowing for the selecting of at least one color without violating an edge. The normalization 
factor thus reads 

^=F(0,0,0) + 3^ P(/i\0,0) + 3 Pih\h\Q) (Bl) 

where the combinatorial factors 3 appearing in the r.h.s. are obtained by noting that P{h, 0, 0) — P{Q, h, 0) = P{Q, 0, h) 
and that P{h\h'^,0) = P{h^,Q,h?) = P{0,h\h^). 

Now, we need to compute the expression for A from Eq. I)A4|I . summing this time only over the d — 1 incoming 
warnings 2, 3..., d in the computation of P in the numerator 

A{q,{v})^Co P{h^=0,h^=0,h^..,h'^) (B2) 

{h=^,..Mi<h^ = h^=0) 

Therefore, A{q, {??}) is easily computed once the expression of P for a given number of d branches are known. Using 
the cavity recursion equations, one can show that, when summing over d neighbors 

(B3) 
(B4) 



P(0,0,0) 
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- Mi) 




h'^<0 
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- 2ry0 


k k 

- P{0, 0, 0) = - 2770 - 11(1 - 3r;,) 

1=1 1=1 


h\h^<0 


k 

i=l 


- Tit) - 


~2 E P(/i\0,0) -P(0,0,0) 




k 

= na 

i=l 


- Til) - 


k k 

-2[](l-2r;,) + n(l-3^0 

i=l i=l 



(B5) 

Using now these relations, with the proper product over the ti or d — 1 incoming messages, we immediately get from 
Eq. (112) 

3nj=i(i - vi) - 3n,=i(i - 2»7») +ni=i(i - 

This equation can be easily generalized to an arbitrary number q of colors, and we find 

EL-o^(-iy(4i)nti[i-a+ih.] ^ ^ 
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